An evaluative baseline for geo-semantic relatedness and similarity

نویسندگان

  • Andrea Ballatore
  • Michela Bertolotto
  • David C. Wilson
چکیده

In geographic information science and semantics, the computation of semantic similarity is widely recognised as key to supporting a vast number of tasks in information integration and retrieval. By contrast, the role of geosemantic relatedness has been largely ignored. In natural language processing, semantic relatedness is often confused with the more specific semantic similarity. In this article, we discuss a notion of geo-semantic relatedness based on Lehrer’s semantic fields, and we compare it with geo-semantic similarity. We then describe and validate the Geo Relatedness and Similarity Dataset (GeReSiD), a new open dataset designed to evaluate computational measures of geo-semantic relatedness and similarity. This dataset is larger than existing datasets of this kind, and includes 97 geographic terms combined into 50 term pairs rated by 203 human subjects. GeReSiD is available online and can be used as an evaluation baseline to determine empirically to what degree a given computational model approximates geo-semantic relatedness and similarity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures

Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...

متن کامل

Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures

Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...

متن کامل

TCDSCSS: Dimensionality Reduction to Evaluate Texts of Varying Lengths - an IR Approach

This paper provides system description of the cross-level semantic similarity task for the SEMEVAL-2014 workshop. Crosslevel semantic similarity measures the degree of relatedness between texts of varying lengths such as Paragraph to Sentence and Sentence to Phrase. Latent Semantic Analysis was used to evaluate the cross-level semantic relatedness between the texts to achieve above baseline sco...

متن کامل

User Interface Design for Semantic Query Expansion in Geo- data Repositories

Semantic query expansion is the process of supplementing a user query with additional terms that interpret and extend the user's information needs. This work presents the results of an empirical study that investigates user preferences for different designs of user interfaces that provide semantic query expansion for data search from geo-data repositories. The study assesses further whether it ...

متن کامل

Hindi Semantic Category Labeling Using Semantic Relatedness Measures

In this paper, we evaluate and compare six semantic relatedness measures used for Hindi semantic category labeling. Our experiments show that the measure “adapted lesk” performed better than other measures. However, a simple baseline system achieved better accuracy than all the measures.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • GeoInformatica

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2014